Young Researchers Workshop —2024
The 2024 Young Researchers Workshop will be held on Cornell's Ithaca campus Wednesday, October 9 through Friday, October 11!
All participants will need to register for this workshop.
(You will find travel, hotel, and parking information below the schedule and presenter information.)
Schedule
Wednesday 10/9
6pm-9pm. Reception: Baker Portico, Physical Sciences Building, 245 East Avenue.
Very limited parking onsite.
Thursday 10/10
8.30-9:45am Sign-in, Breakfast and Poster Session 1: Duffield Hall Atrium, 343 Campus Road.
10:00am-11:00am Opening Session (G10 Biotech) Hosted by Adrian Lewis
- Yuchen Wu—Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models
- Yu Ma—M3H: Multimodal Multitask Machine Learning for Healthcare
11:00am - 11:30am Coffee Break
11:30am - 1:00pm Optimization I (G10 Biotech) Hosted by Soroosh Shafiee
- Xin Jiang—Graph sequences with finite-time convergence for decentralized average consensus and its application in distributed optimization
- Xiaopeng Li—Proximal random reshuffling under local Lipschitz continuity
- Tianjiao Li—A simple uniformly optimal method without line search for convex optimization
- Zikai Xiong—Enhancing the Solution Methods for Huge-Scale Linear Programming via Level-Set Geometry
1:00pm - 2:00pm Lunch (Biotech Atrium)
2:00pm - 3:30pm Reinforcement Learning (G10 Biotech) Hosted by Ziv Scully
- Yuchen Hu—Policy Evaluation in Dynamic Experiments
- Lucy Huo—Asymptotic product-form steady state for multiclass queueing networks in multi-scale heavy traffic
- Yashaswini Murthy—Learning and Control in Countable State-Space
- Ming Yin—Understanding Q*: from sequential decision making to Large language model alignment
3:30pm - 4:00pm Coffee Break
4:00pm - 5:30pm Markets and Decisions (G10 Biotech) Hosted by Paul Gölz
- Nicolas Christianson—Reliable AI-Augmented Algorithms for Energy and Sustainability
- Meena Jagadeesan— Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry
- Devansh Jalota—Algorithm and Incentive Design for Sustainable Resource Allocation: Beyond Classical Fisher Markets
- Jiaqi Zhang—Designing and learning from large-scale interventions
6:30pm – 9:00pm Dinner. Ithaca Downtown Conference Center, 116 East Green Street
Friday 10/11
8.30am-9:30am Breakfast and Poster Session 2: Duffield Hall Atrium, 343 Campus Road
9:30am - 9:45am Walk to Statler Hall, Room 265, 106 Statler Drive
9:45am-11am Optimization II (Statler Hall, Room 265) Hosted by Peter Frazier
- Yifan Hu—Contextual Stochastic Bilevel Optimization and Three-Stage Stochastic Programming
- Akshit Kumar—Dynamic Resource Allocation: Algorithmic Design Principles and Spectrum of Achievable Performances
- Xinyi Chen—A Non-stochastic Control Approach to Optimization
11:00am - 11:30am Coffee Break
11:30am - 12:45pm Causal Inference & Distribution Shift (Statler Hall, Room 265) Hosted by Christina Lee Yu
- Ayoub Foussoul—Distributionally Robust Newsvendor on a Metric
- Yihong Gu—Causality Pursuit from Heterogeneous Environments
- Reese Pathak—Towards optimal learning under distribution shift: new complexity measures and algorithms for tackling covariate shift
1:00pm – 2:00pm Lunch (available “to go”) Duffield Atrium
Presenter | Title |
---|---|
Jerry Anunrojwong | Battery Operations in Electricity Markets: Strategic Behavior and Distortions |
Manuel Arnese | Convergence of Coordinate Ascent Variational Inference for log-concave measures via optimal transport |
Sandeep Chitla | Consumers' Cart-Building Behavior in Online Grocery: Impact of Cart-Level Promotions |
Natalie Collina | Algorithmic Collusion Without Threats |
Anna Deza | |
Myungeun Eom | Batching and Greedy Policies: How Good Are They in Dynamic Matching? |
Andrei Graur | Sparse Submodular Function Minimization |
Feihong Hu | "Uber" Your Cooking: The Sharing-Economy Operations of a Ghost-Kitchen Platform |
Marouane Ibn Brahim | Maximum Load Assortment Optimization: Approximation Algorithms and Adaptivity Gaps |
Youngseo Kim | Estimate then Predict: Convex Formulation for Travel Demand Forecasting |
Thodoris Koukouvinos | A novel algorithm for a broad class of non convex optimization problems |
Eitan Levin | Any-dimensional optimization |
Brian Liu | FASTopt: An Optimization Framework for Fast Additive Segmentation in Transparent ML |
Calum MacRury | On Online Contention Resolution Schemes for the Matching Polytope of Graphs |
Jason Milionis | A Myersonian Framework for Optimal Liquidity Provision in Automated Market Makers |
Naren Manoj | On the Robustness of Spectral Algorithms for Semirandom Stochastic Block Models |
Laurel Newman | When Lines Become Contagious: Analyzing the Spread of Infection in the M/M/1 Queue |
Sloan Nietert | Robust Decision Making with Local and Global Adversarial Corruptions |
Haripriya Pulyassary | Network Flow Problems with Electric Vehicles |
Ayush Sekhari | Learning-Unlearning Schemes |
Richard Shapley | An additive approximation for unsplittable multicommodity flow in outerplanar graphs |
Devin Smedira | Deterministically Computing Volumes and Integrals in High Dimensions |
Asterios Tsiourvas | Overcoming the Optimizer’s Curse: Obtaining Realistic Prescriptions from Neural Networks |
Jie Wang | Regularization for Adversarial Robust Learning |
Guanghui Wang | Faster Margin Maximization Rates for Generic and Adversarially Robust Optimization Methods |
Priyank Agrawal | Optimistic Q-learning for average reward and episodic reinforcement learning |
Mallory Gaspard | Monotone Causality in Opportunistically Stochastic Shortest Path Problems |
Jaingze Han | Probabilistic design in the operation of service systems |
Boya Hou | Nonparametric Sparse Learning of Dynamical Systems |
Laixi Shi | Can We Break the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning? |
Jonah Botvinick-Greenhouse |
Presenter | Title |
---|---|
Matheus Jun Ota | Benders cuts via corner polyhedra: an application to the stochastic vehicle routing problem |
Marios Papachristou | Optimal Resource Allocation for Remediating Networked Contagions |
Vrishabh Patil | Addressing Estimation Errors on Expected Asset Returns through Robust Portfolio Optimization |
Ali Shirali | Allocation Requires Prediction Only if Inequality is Low |
Man Yiu Tsang | On the Trade-off Between Distributional Belief and Ambiguity: Conservatism, Finite-Sample Guarantees, and Asymptotic Properties |
Kabir Verchand | Sharp guarantees for iterative nonconvex optimization with random data |
Chonghuan Wang | Adaptive Experimental Design In Operations |
Qi Wang | Stochastic Constrained Optimization |
Zhuoyu Xiao | Synchronous and Asynchronous Gradient-Response Schemes for Computing Quasi-Nash Equilibria under Uncertainty |
Kunhe Yang | Platforms for Efficient and Incentive-Aware Collaboration |
Wen Yun | Vague Pricing Optimization |
Anthony Karahalios | Column Elimination |
Yoav Kolumbus | |
Tu Ni | |
Abdellah Aznag | An active learning framework for multi-group mean estimation |
Qian Xie | Cost-aware Bayesian optimization via the Pandora's Box Gittins index |
Yifang Chen | Algorithmic data efficient learning in the era of large model |
Rares Cristian | Robust End-to-End Learning under Endogenous Uncertainty |
Haiyun He | Information-Theoretic Generalization Bounds for Deep Neural Networks |
Vikas Deep | Asymptotically Optimal Adaptive A/B tests for Average Treatment Effect |
Zhicheng Guo | Improving Clinician Performance in Classification of EEG Patterns on the Ictal-Interictal-Injury Continuum using Interpretable Machine Learning |
Su Jia | |
Sammy Khalife | Sample Complexity of Data-Driven Algorithm Design using Neural Networks |
Yueying Li | Sustainable LLM Lifecycle |
Varun Suriyanarayana | Online Generalized Flow and Load Balancing with recourse |
Zongyi Li | Neural operator for partial differential equations |
Zhou Lu | When is Inductive Inference Possible? |
Jennifer Sun | Online Control in Population Dynamics |
Yige Hong | Unichain and Aperiodicity are Sufficient for Asymptotic Optimality of Average-Reward Restless Bandits |
Taylan Kargin | Infinite-Horizon Distributionally Robust Regret-Optimal Control |
Minda Zhao | Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action |
Abstracts
Optimization II—Hosted by Peter Frazier
Friday, October 11, 9:45 a.m.— 11:00 a.m. (Statler Hall, Room 265)
Yifan Hu—Contextual Stochastic Bilevel Optimization and Three-Stage Stochastic Programming
We introduce contextual stochastic bilevel optimization (CSBO) -- a stochastic bilevel optimization framework with the lower-level problem minimizing an expectation conditioned on some contextual information and the upper-level decision variable. This framework extends classical stochastic bilevel optimization when the lower-level decision maker responds optimally not only to the decision of the upper-level decision maker but also to some side information and when there are multiple or even infinite many followers. It captures important applications such as meta-learning, personalized federated learning, end-to-end learning, and Wasserstein distributionally robust optimization with side information (WDRO-SI). Due to the presence of contextual information, existing single-loop methods for classical stochastic bilevel optimization are unable to converge. To overcome this challenge, we introduce an efficient double-loop gradient method based on the Multilevel Monte-Carlo (MLMC) technique and establish its sample and computational complexities. When specialized to stochastic nonconvex optimization, our method matches existing lower bounds. Extending to three-stage stochastic programming, our results break the long-standing belief about three-stage stochastic programming is harder than classical stochastic optimization, and open up new directions for algorithmic design for three-stage problems.
Akshit Kumar—Dynamic Resource Allocation: Algorithmic Design Principles and Spectrum of Achievable Performances
We consider a broad class of dynamic resource allocation problems, and study the performance of practical algorithms. In particular, we focus on the interplay between the distribution of request types and achievable performance, given the broad set of configurations that can be encountered in practical settings. While prior literature studied either a small number of request types or a continuum of types with no gaps, the present work allows for a large class of type distributions. Using initially the prototypical multi-secretary problem to explore fundamental performance limits as a function of type distribution properties, we develop a new algorithmic property “conservativeness with respect to gaps,” that guarantees near-optimal performance. In turn, we introduce a practically-motivated, simulation-based algorithm, and establish its near-optimal performance, not only for multi-secretary problems, but also for general dynamic resource allocation problems.
Xinyi Chen—A Non-stochastic Control Approach to Optimization
Selecting the best hyperparameters for a particular optimization instance, such as the learning rate and momentum, is an important but nonconvex problem. As a result, iterative optimization methods such as hypergradient descent lack global optimality guarantees in general.We propose an online nonstochastic control methodology for mathematical optimization. First, we formalize the setting of meta-optimization, an online learning formulation of learning the best optimization algorithm from a class of methods. The meta-optimization problem over gradient-based methods can be framed as a feedback control problem over the choice of hyperparameters, including the learning rate, momentum, and the preconditioner. We show how recent methods from online nonstochastic control can be applied to develop a convex relaxation, and obtain regret guarantees vs. the best offline solution. This guarantees that in meta-optimization, given a sequence of optimization problems, we can learn a method with performance comparable to that of the best method in hindsight from a class of methods. We end with experiments on a variety of tasks, from regression to deep neural network training, that demonstrate the practical effectiveness of our method.
Causal Inference and Distribution Shift—Hosted by Christina Lee Yu
Friday, October 11, 11:30 a.m. — 12:45 p.m. (Statler Hall, Room 265)
Ayoub Foussoul—Distributionally Robust Newsvendor on a Metric
We study the distributionally robust newsvendor on a metric problem, a fundamental generalization of the distributionally robust newsvendor problem of Scarf (1957), where the decision maker needs to jointly determine the inventory levels at multiple locations on a metric and design an online fulfillment policy for the uncertain demand that realizes sequentially over time. The goal is to minimize the total expected inventory and fulfillment costs. We design a near-optimal policy for the problem with theoretical guarantees on its performance. Our policy generalizes the classical solution of Scarf (1957), maintaining its simplicity and interpretability: it identifies a hierarchical set of clusters, assigns a “virtual” underage cost to each cluster, then makes sure that each cluster holds at least the inventory suggested by Scarf’s solution if the cluster behaved as a single point with “virtual” underage cost and original overage cost. As demand arrives sequentially, our policy fulfills orders from nearby clusters, minimizing transshipment costs, while balancing inventory consumption across the clusters to avoid depleting any single one. In addition to its theoretical performance, numerical experiments show that our policy performs well in practice.
Yihong Gu—Causality Pursuit from Heterogeneous Environments
Pursuing causality from data is a fundamental problem in scientific discovery, treatment intervention, and transfer learning. This paper introduces a novel algorithmic method for addressing nonparametric invariance and causality learning in regression models across multiple environments, where the joint distribution of response variables and covariates varies, but the conditional expectations of outcome given an unknown set of quasi-causal variables are invariant. The challenge of finding such an unknown set of quasi-causal or invariant variables is compounded by the presence of endogenous variables that have heterogeneous effects across different environments, including even one of them in the regression would make the estimation inconsistent. The proposed Focused Adversial Invariant Regularization (FAIR) framework utilizes an innovative minimax optimization approach that breaks down the barriers, driving regression models toward prediction-invariant solutions through adversarial testing. Leveraging the representation power of neural networks, FAIR neural networks (FAIR-NN) are introduced for causality pursuit. It is shown that FAIR-NN can find the invariant variables and quasi-causal variables under a minimal identification condition and that the resulting procedure is adaptive to low-dimensional composition structures in a non-asymptotic analysis. Under a structural causal model, variables identified by FAIR-NN represent pragmatic causality and provably align with exact causal mechanisms under conditions of sufficient heterogeneity. Computationally, FAIR-NN employs a novel Gumbel approximation with decreased temperature and stochastic gradient descent ascent algorithm. The procedures are convincingly demonstrated using simulated and real-data examples.
Reese Pathak—Towards optimal learning under distribution shift: new complexity measures and algorithms for tackling covariate shift
Common machine learning practice implicitly assumes that training data will resemble future data. However, this is often violated, as seen in evolving e-commerce trends, shifting patient populations in healthcare, and changing environments in autonomous driving. Ignoring such distribution shifts can lead to alarming consequences like misguided recommendations, ineffective medical treatments, or risky maneuvers in self-driving cars. In this talk, we describe new advances in learning under a form of distribution shift known as covariate shift. Our main contribution is establishing the (minimax) statistical complexity for prediction and estimation under covariate shift, within the rich framework of reproducing kernel Hilbert spaces (RKHSs). A key tool we develop is a novel, semidefinite programming (SDP)-based complexity measure through which we determine the optimal statistical rate. Our analysis provides concrete recommendations for new, rate-optimal procedures, based on a form of distribution-dependent shrinkage. Unlike previous results in this area, our work is comparatively fine-grained and assumption-light: we impose essentially no restrictions on covariate distributions and require neither the existence nor boundedness of likelihood ratios, thereby broadening the applicability of our theory.
Poster Presenter Information
Poster sessions will be held 8:30am-9:45am Thursday Oct 10 (with sign in) and 8:30am-9:30am Friday Oct 11 in the atrium of Duffield Hall (343 Campus Rd, map here), in concert with breakfast (provided). You can either bring a physical copy of your poster with you, or bring a digital copy on USB drive and pay to print locally at Cornell University’s Mann Library. Posters should be at most36" x 48” (inches).
Transportation in and around Ithaca
Tompkins County Area Transit (TCAT) serves the greater Ithaca area and the Cornell campus. TCAT has a free app you can use to find route information as well as to pay fares. You can learn more and download the TCAT app here.
Lyft and Uber are also available.
Parking on campus
Parking rules and regulations are in effect Mon-Fri 7:30am-5:00pm and some locations have signs that state there are night and weekend restrictions too. If you would like to park on campus, you can pay for parking through the Parkmobile app on your phone. Short-term parking options.
Travel Information:
Airports and Buses
Ithaca Airport: The closest option for flying would be arriving at the Ithaca airport. It is only three miles away from Cornell campus, and an easy taxi/uber/lyft ride to get to and from. Many hotels provide shuttle service to and from the airport as well as around town.
Syracuse Airport: While Syracuse has a lot more flight options, it is a bit over an hour lyft/uber ride to get from Syracuse to Ithaca. Or you can rent a car via this link.
Other nearby airports (less than an hour drive) are Elmira and Binghamton. Both are quite small.
NYC Airports: The last option would be flying into any of the major NYC airports and taking a bus to Ithaca. This is the longest option, as most buses from NYC to Ithaca take about 4.5 hours. Some bus options include:
- Cornell Campus to Campus: Most comfortable option that leaves from the Cornell Club in midtown NYC and drops you off right on Cornell campus in Ithaca.
- OurBus: Has options leaving from either the George Washington Bridge or Port Authority to downtown Ithaca. From downtown it is a short cab ride to get to Cornell.
Ithaca Hotels:
2310 N Triphammer Rd, Ithaca, NY 14850•(607) 257-3100
Courtyard by Marriott Ithaca Airport/University
29 Thornwood Dr, Ithaca, NY 14850•(607) 330-1000
Fairfield Inn & Suites by Marriott Ithaca
359 Elmira Rd, Ithaca, NY 14850•(607) 277-1000
337 Elmira Rd, Ithaca, NY 14850•(607) 277-5500
Hotels in the Greater Ithaca Area:
Best Western Plus Finger Lakes Inn & Suites
3175 Fingerlake East Dr, Cortland, NY 13045•(607) 756-2233
Fairfield by Marriott Inn & Suites Cortland
3707 NY-281, Cortland, NY 13045•(607) 299-4455
26 River St, Cortland, NY 13045•(607) 662-0007
Where the Locals Stay:
518 Stewart Ave, Ithaca, NY 14850•(607) 319-4611
Firelight Camps
1150 Danby Road, Ithaca, NY•(607) 229-1644
Reservations: reservations@firelightcamps.com
Ithaca also has many premier Bed & Breakfasts, Airbnbs, VRBOs.